202 research outputs found

    LMC Self-lensing from a new perspective

    Get PDF
    We present a new analysis on the issue of the location of the observed microlensing events in direction of the Large Magellanic Cloud (LMC). This is carried out starting from a recently drawn coherent picture of the geometrical structure and dynamics of the LMC disk and by considering different configurations for the LMC bar. In this framework it clearly emerges that the spatial distribution of the events observed so far shows a near--far asymmetry. This turns out to be compatible with the optical depth calculated for the LMC halo objects. In this perspective, our main conclusion, supported by a statistical analysis on the outcome of an evaluation of the microlensing rate, is that self lensing can not account for all the observed events. Finally we propose a general inequality to calculate quickly an upper limit to the optical depth along a line of view through the LMC center.Comment: revised version (minor changes) Accepted for publication in A&

    From Nonspecific DNA–Protein Encounter Complexes to the Prediction of DNA–Protein Interactions

    Get PDF
    ©2009 Gao, Skolnick. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.doi:10.1371/journal.pcbi.1000341DNA–protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA–protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA–protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA–protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA–protein interaction modes exhibit some similarity to specific DNA–protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Ca deviation from native is up to 5 Å from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA–protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein

    Biophysical and electrochemical studies of protein-nucleic acid interactions

    Get PDF
    This review is devoted to biophysical and electrochemical methods used for studying protein-nucleic acid (NA) interactions. The importance of NA structure and protein-NA recognition for essential cellular processes, such as replication or transcription, is discussed to provide background for description of a range of biophysical chemistry methods that are applied to study a wide scope of protein-DNA and protein-RNA complexes. These techniques employ different detection principles with specific advantages and limitations and are often combined as mutually complementary approaches to provide a complete description of the interactions. Electrochemical methods have proven to be of great utility in such studies because they provide sensitive measurements and can be combined with other approaches that facilitate the protein-NA interactions. Recent applications of electrochemical methods in studies of protein-NA interactions are discussed in detail

    A reexamination of information theory-based methods for DNA-binding site identification

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Searching for transcription factor binding sites in genome sequences is still an open problem in bioinformatics. Despite substantial progress, search methods based on information theory remain a standard in the field, even though the full validity of their underlying assumptions has only been tested in artificial settings. Here we use newly available data on transcription factors from different bacterial genomes to make a more thorough assessment of information theory-based search methods.</p> <p>Results</p> <p>Our results reveal that conventional benchmarking against artificial sequence data leads frequently to overestimation of search efficiency. In addition, we find that sequence information by itself is often inadequate and therefore must be complemented by other cues, such as curvature, in real genomes. Furthermore, results on skewed genomes show that methods integrating skew information, such as <it>Relative Entropy</it>, are not effective because their assumptions may not hold in real genomes. The evidence suggests that binding sites tend to evolve towards genomic skew, rather than against it, and to maintain their information content through increased conservation. Based on these results, we identify several misconceptions on information theory as applied to binding sites, such as negative entropy, and we propose a revised paradigm to explain the observed results.</p> <p>Conclusion</p> <p>We conclude that, among information theory-based methods, the most unassuming search methods perform, on average, better than any other alternatives, since heuristic corrections to these methods are prone to fail when working on real data. A reexamination of information content in binding sites reveals that information content is a compound measure of search and binding affinity requirements, a fact that has important repercussions for our understanding of binding site evolution.</p

    Single-Molecule Analysis Reveals the Kinetics and Physiological Relevance of MutL-ssDNA Binding

    Get PDF
    DNA binding by MutL homologs (MLH/PMS) during mismatch repair (MMR) has been considered based on biochemical and genetic studies. Bulk studies with MutL and its yeast homologs Mlh1-Pms1 have suggested an integral role for a single-stranded DNA (ssDNA) binding activity during MMR. We have developed single-molecule Förster resonance energy transfer (smFRET) and a single-molecule DNA flow-extension assays to examine MutL interaction with ssDNA in real time. The smFRET assay allowed us to observe MutL-ssDNA association and dissociation. We determined that MutL-ssDNA binding required ATP and was the greatest at ionic strength below 25 mM (KD = 29 nM) while it dramatically decreases above 100 mM (KD>2 µM). Single-molecule DNA flow-extension analysis suggests that multiple MutL proteins may bind ssDNA at low ionic strength but this activity does not enhance stability at elevated ionic strengths. These studies are consistent with the conclusion that a stable MutL-ssDNA interaction is unlikely to occur at physiological salt eliminating a number of MMR models. However, the activity may infer some related dynamic DNA transaction process during MMR

    Evolutionary tradeoffs in cellular composition across diverse bacteria

    Get PDF
    One of the most important classic and contemporary interests in biology is the connection between cellular composition and physiological function. Decades of research have allowed us to understand the detailed relationship between various cellular components and processes for individual species, and have uncovered common functionality across diverse species. However, there still remains the need for frameworks that can mechanistically predict the tradeoffs between cellular functions and elucidate and interpret average trends across species. Here we provide a comprehensive analysis of how cellular composition changes across the diversity of bacteria as connected with physiological function and metabolism, spanning five orders of magnitude in body size. We present an analysis of the trends with cell volume that covers shifts in genomic, protein, cellular envelope, RNA and ribosomal content. We show that trends in protein content are more complex than a simple proportionality with the overall genome size, and that the number of ribosomes is simply explained by cross-species shifts in biosynthesis requirements. Furthermore, we show that the largest and smallest bacteria are limited by physical space requirements. At the lower end of size, cell volume is dominated by DNA and protein content—the requirement for which predicts a lower limit on cell size that is in good agreement with the smallest observed bacteria. At the upper end of bacterial size, we have identified a point at which the number of ribosomes required for biosynthesis exceeds available cell volume. Between these limits we are able to discuss systematic and dramatic shifts in cellular composition. Much of our analysis is connected with the basic energetics of cells where we show that the scaling of metabolic rate is surprisingly superlinear with all cellular components

    Inferring Binding Energies from Selected Binding Sites

    Get PDF
    We employ a biophysical model that accounts for the non-linear relationship between binding energy and the statistics of selected binding sites. The model includes the chemical potential of the transcription factor, non-specific binding affinity of the protein for DNA, as well as sequence-specific parameters that may include non-independent contributions of bases to the interaction. We obtain maximum likelihood estimates for all of the parameters and compare the results to standard probabilistic methods of parameter estimation. On simulated data, where the true energy model is known and samples are generated with a variety of parameter values, we show that our method returns much more accurate estimates of the true parameters and much better predictions of the selected binding site distributions. We also introduce a new high-throughput SELEX (HT-SELEX) procedure to determine the binding specificity of a transcription factor in which the initial randomized library and the selected sites are sequenced with next generation methods that return hundreds of thousands of sites. We show that after a single round of selection our method can estimate binding parameters that give very good fits to the selected site distributions, much better than standard motif identification algorithms

    The Influence of Transcription Factor Competition on the Relationship between Occupancy and Affinity

    Get PDF
    Transcription factors (TFs) are proteins that bind to specific sites on the DNA and regulate gene activity. Identifying where TF molecules bind and how much time they spend on their target sites is key to understanding transcriptional regulation. It is usually assumed that the free energy of binding of a TF to the DNA (the affinity of the site) is highly correlated to the amount of time the TF remains bound (the occupancy of the site). However, knowing the binding energy is not sufficient to infer actual binding site occupancy. This mismatch between the occupancy predicted by the affinity and the observed occupancy may be caused by various factors, such as TF abundance, competition between TFs or the arrangement of the sites on the DNA. We investigated the relationship between the affinity of a TF for a set of binding sites and their occupancy. In particular, we considered the case of the transcription factor lac repressor (lacI) in E.coli, and performed stochastic simulations of the TF dynamics on the DNA for various combinations of lacI abundance and competing TFs that contribute to macromolecular crowding. We also investigated the relationship of site occupancy and the information content of position weight matrices (PWMs) used to represent binding sites. Our results showed that for medium and high affinity sites, TF competition does not play a significant role for genomic occupancy except in cases when the abundance of the TF is significantly increased, or when the PWM displays relatively low information content. Nevertheless, for medium and low affinity sites, an increase in TF abundance (for both cognate and non-cognate molecules) leads to an increase in occupancy at several sites. © 2013 Zabet et al

    Accurate Prediction of Inducible Transcription Factor Binding Intensities In Vivo

    Get PDF
    DNA sequence and local chromatin landscape act jointly to determine transcription factor (TF) binding intensity profiles. To disentangle these influences, we developed an experimental approach, called protein/DNA binding followed by high-throughput sequencing (PB–seq), that allows the binding energy landscape to be characterized genome-wide in the absence of chromatin. We applied our methods to the Drosophila Heat Shock Factor (HSF), which inducibly binds a target DNA sequence element (HSE) following heat shock stress. PB–seq involves incubating sheared naked genomic DNA with recombinant HSF, partitioning the HSF–bound and HSF–free DNA, and then detecting HSF–bound DNA by high-throughput sequencing. We compared PB–seq binding profiles with ones observed in vivo by ChIP–seq and developed statistical models to predict the observed departures from idealized binding patterns based on covariates describing the local chromatin environment. We found that DNase I hypersensitivity and tetra-acetylation of H4 were the most influential covariates in predicting changes in HSF binding affinity. We also investigated the extent to which DNA accessibility, as measured by digital DNase I footprinting data, could be predicted from MNase–seq data and the ChIP–chip profiles for many histone modifications and TFs, and found GAGA element associated factor (GAF), tetra-acetylation of H4, and H4K16 acetylation to be the most predictive covariates. Lastly, we generated an unbiased model of HSF binding sequences, which revealed distinct biophysical properties of the HSF/HSE interaction and a previously unrecognized substructure within the HSE. These findings provide new insights into the interplay between the genomic sequence and the chromatin landscape in determining transcription factor binding intensity

    E. coli metabolic protein aldehydealcohol dehydrogenase-E binds to the ribosome: a unique moonlighting action revealed

    Get PDF
    It is becoming increasingly evident that a high degree of regulation is involved in the protein synthesis machinery entailing more interacting regulatory factors. A multitude of proteins have been identified recently which show regulatory function upon binding to the ribosome. Here, we identify tight association of a metabolic protein aldehyde-alcohol dehydrogenase E (AdhE) with the E. coli 70S ribosome isolated from cell extract under low salt wash conditions. Cryo-EM reconstruction of the ribosome sample allows us to localize its position on the head of the small subunit, near the mRNA entrance. Our study demonstrates substantial RNA unwinding activity of AdhE which can account for the ability of ribosome to translate through downstream of at least certain mRNA helices. Thus far, in E. coli, no ribosome-associated factor has been identified that shows downstream mRNA helicase activity. Additionally, the cryo-EM map reveals interaction of another extracellular protein, outer membrane protein C (OmpC), with the ribosome at the peripheral solvent side of the 50S subunit. Our result also provides important insight into plausible functional role of OmpC upon ribosome binding. Visualization of the ribosome purified directly from the cell lysate unveils for the first time interactions of additional regulatory proteins with the ribosom
    corecore